Distinguishability Measures and Entropies for General Probabilistic Theories 
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As a part of the construction of an information theory based on general probabilistic theories, 
we propose and investigate the several distinguishability measures and "entropies" in general prob- 
abilistic theories. As their applications, no-cloning theorems, information-disturbance theorems are 
reformulated, and a bound of the accessible informations is discussed in any general probabilistic 
theories, not resorting to quantum theory. We also propose the principle of equality for pure states 
which makes general probabilistic theories to be more realistic, and discuss the role of entropies as 
a measure of pureness. 



I. INTRODUCTION 

Recent development of the quantum information the- 
ory has shown us the ability of information processings 
and computations based on the quantum physics can go 
far beyond those based on classical physics. At its heart, 
this is because the potential ability of a probability is en- 
larged from classical theory to quantum theory. Indeed, 
quantum theory can be considered as a probabilistic the- 
ory, which — in some sense — properly includes the clas- 
sical probability theory (Kolomogorov's probability the- 
ory). However, this does not mean that quantum theory 
is the most general theory of a probability even among 
the possible theories which have an operational mean- 
ings. So far, the most general theory of a probability 
with a suitable operational meanings has been developed 
by several researchers (See for instance (HiSSIljIliSH)- 
Following the recent trend, we call such theories the gen- 
eral probabilistic theories (or simply GPTs). 

As the quantum information theory has been con- 
structed based on the quantum theory, information theo- 
ries can be constructed based on each probabilistic theory 
0, i, i, EH, [H, m, H • There are several motivations 
for this line of researches: First, this is an attempt to 
find physical principles (axioms written by physical lan- 
guages) for quantum theory [l^, [l^, [l3| . Indeed, by con- 
sidering the general framework which encompasses the 
quantum theory, we look for principles which determine 
the position of the quantum theory in this general frame- 
work. The development of the quantum information the- 
ory motivate us to find the principles based on infor- 
mation processings for the theory of quantum physics 
[13, [H, [3- Second, the construction of the information 
theory based on the most general theory of probability 
enables us to understand logical connections among in- 
formation processings by resorting to the particular prop- 
erties of neither classical nor quantum theory, but only to 
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the essential properties which a suitable probability the- 
ory should possess. Third, this is a preparation for the 
possible break of quantum theory. For instance, one can 
discuss a secure key distribution in the general frame- 
work without assuming quantum theory itself (20| . Fi- 
nally, this might provide a classical information theory 
under some restrictions of measurements, since any gen- 
eral probabilistic theories has a classical interpretation 
based on such restrictions of measurements 0, [2l| . 

In this paper, we propose and give systematic discus- 
sions of several distinguishability measures (especially, 
Kolmogorov distance and fidelity) and three quantities re- 
lated to entropies for general probabilistic theories. The 
corresponding measures and entropies in classical and 
quantum theory have been proved to be useful [l^, [25j , 
and we give generalizations for them in any GPTs and 
discuss their applications. In particular, no-cloning the- 
orem and a simple information-disturbance theorem in 
GPTs are reformulated using fidelity, and a bound of 
the accessible information is discussed based on one of 
the "entropies" . Finally, we introduce and formulate the 
principle of "equality of pure states" meaning that there 
arc no special pure states. We call such GPT symmetric 
and in symmetric GPT, the measure of pureness will be 
discussed. 



II. GENERAL PROBABILISTIC THEORIES 

In this section, we give a brief review of general proba- 
bilistic theories (See for instance 0,0, 0,01 and references 
therein for details.) Although, in the end, we are going to 
use mathematical notions such as convexity, affine func- 
tions, etc., it should be noticed that we do not assume 
any mathematical structure without physical reasons. 

The important ingredients of the GPTs are the notions 
of state and measurement. In any GPT, we have a phys- 
ical law to determine a probability p{a\M,s) to obtain 
an output a by a measurement M of an observable un- 
der a state s. In this paper, for simplicity, we only treat 
a measurement with a finitely many outcomes. Natu- 
rally, we assume the separating properties of both states 
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and measurements: (Al) States si and S2 are identified 
if p{a\M, si) = p{a\M,S2) for any measurement M and 
measurement outcome a; (A2) Measurement Mi and M2 
are identified if p{a\Mi, s) = p{a\M2, s) for any measure- 
ment outcome a under any state s. We also assume tlie 
convex property of states; (A3) For any states si, S2 and 
q G [0, 1], tficrc exists the state s to prepare si with prob- 
ability q and S2 with probability 1 — g; namely, it follows 
that p{a\M, s) = qp{a\M, si) -I- (1 — q)p{a\M, S2) for any 
measurement; (A4) Further, we naturally assume that 
the dynamics preserves this probabilistic mixtures; (A5) 
We introduce a natural topology on the state space which 
is the weakest topology such that s p{a\M, s) is con- 
tinuous for any measurements; Finally, we assume (A6) a 
joint state to of system A + B defines a joint probability 
for each measurements Ma and Mb which satisfies the 
no-signaling condition, i.e., the marginal probabilities for 
the outcomes of a measurement on A do not depend on 
the measurement choices on B, and vice versa. More- 
over, the joint state is determined by joint probabilities 
for all pairs of measurements of A and B. 

Based on these, one can show the followings P, [1,[1,0|: 

(a) There exists a locally convex topological vector 
space V such that, in a suitable representation, the state 
space S is a convex subset in V where qsi + {l~q)s2 corre- 
sponds to the state described in (A3) above. An extreme 
point of S is called a pure state. Moreover, without loss 
of generality, one can assume that S is compact with a 
natural topology [l^. Notice that by the famous Krein- 
Milman theorem (see, for instance, Theorem 10.4 in (U) 
the set of extreme points Spure is non-empty and S is 
the closed convex hull of extreme points. In particular, 
in finite dimensional cases, any state s d S has a convex 
decomposition with finite numbers of pure states (here- 
after, a pure state decomposition): s = J2xP^^^ where 
Px > O1 y^.^Pr. = l,Sx G Spure (scc, for instancc, Theo- 
rem 5.6 in [23|). 

A map / : 5 ^ R is called an affine functional if it 
satisfies f{qsi + (1 - 17)52) qf{si) + (1 - q)f{s2) for any 
q e [0, l],si,S2 e S- In particular an afhne functional 
e : iS ^ M is called an effect if the range is contained 
in [0,1]. We denote the sets of all the afhne functional 
and all the effects by AiS) and £{S), respectively. It is 
easy to see that £{S) is a convex subset of a real vector 
space A{S)- We call an extreme effect a pure effect. The 
zero effect and unit effect u such that 0(s) = and 
u{s) = 1 are trivially pure effects. It is easy to see that 
effect u — e is pure iff effect e is pure. Moreover, we 
can introduce a natural topology on £{S) which is the 
weakest topology such that the map £{S) R, e e(s), 
becomes continuous for every s e 5- One can that £{S) 
is compact with respect to this topology fl^ . 

(b) It is often convenient to characterize a measure- 
ment without explicitly specifying the measurement out- 
comes. In that case, any measurement M is character- 
ized by the set of effects rrii such that p{ai\M, s) = mi{s) 
and mi = u: In the following, we occasionally use 
the notation M = (implicitly assuming conditions 



Tnj G £{S) and ruj = u) to denote the measurement 
on S meaning that rUj (s) is the probability to obtain j th 
output (say aj) by a measurement M under a state s. 

(c) Dynamics is described by an affine Junction on state 
space. In general, the initial state space S and final state 
space S' might be different. Then, a time evolution map 
is given by an affine map / from S to S' . We denote by 
A{StS') the set of all the affine map from S to S' . 

(d) The joint systems are described by a convex set 
in a tensor product of the corresponding vector spaces. 
A joint state lo ox\ A + B with state spaces Sa and Sb 
is described by a bi-affine map on £{Sa) x £{Sb)- In 
particular, if uj is a joint state on A-\-B, then the marginal 
state of A is defined by WA(e) := uj{e,UB) (e £ £{Sa)) 
where is the unit effect on Sb- From the extreme 
property of pure states, it is easy to see [111 that if the 
marginal state loa is pure, then a joint state w is a state 
with no correlations: ^^(6^1,63) = LUA{eA)ij-'B{sB) {s-A € 
£{SA),eB e£{SB))- 

It is important to notice that all the mathematical 
structures are not introduced ad hoc but they appear 
naturally based on physical assumptions (A1-A6). It is 
also possible to formulate the measurement process by 
considering the cone generated by and 5 in F [3.[5l.[l0|. 

In this paper, we treat for simplicity finite GPT where 
V is finite dimensional, but most of the definitions and 
properties below holds with some topological remarks 
[22( 1. (However, notice that in finite dimensional cases, 
there are essentially the unique topology, and one can 
use another characterization of the natural topology, for 
instancc using the Kolmogorov distance below. In par- 
ticular, the unique topology is the Euclidean topology 
and thus one can imagine a state space of each GPT as 
any compact convex (or equivalently closed bounded con- 
vex) subset in Euclidean spaces.) Moreover, we assume 
that any set of effects rui such that — ''^ has a 

correspondent measurement. (It is also easy exercises to 
reformulate below without this assumption.) 

Here, let us see the typical examples for finite GPTs. 

[Finite Classical Systems] Let = {cui, . . . ,ujd} be a 
sample space. The state is represented by a probability 
Pi for an elementary event LUi. The state space is given by 
Sc {p e \pi > 0, Ef=iK = !}■ There are d num- 
bers of pure states, which are the definite states where 
one of the elementary event occurs with probability 1: 
Namely, p('') = (<5^i, . . . , S^^d) € Sc {l^ = I, ■ ■ ■ ,d)B No- 
tice that Sc is a (standard) simplex. In particular, any 
state p = (pi, . . . ,pd) € Sc has the unique pure state 
decomposition: P = X]^=i Pa'P''^''- 

[Finite Quantum Systems] Let 7i be a d dimensional 
complex Hilbert space. A quantum state is represented 
by a density operator p, i.e., a positive operator on 7i 
with unit trace. The state space is given by Sq = {p & 
C{Ti) I p > 0,trp = 1} where C{Ti) is a real vec- 
tor space of all the (Hermitian) operator on 7i- Pure 
states are characterized by 1 dimensional projection op- 
erators. A quantum effect e is represented by an operator 
E satisfying < i? < I, called a POVM (positive op- 
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erator valued measure) element, by the correspondence 
e{p) = tT{pE). Here 0,1 denote the zero and identity 
operator on TY. In particular, any measurement of an 
observable M = {m.i)i where rrii are effects on Sq has 
the correspondent POVM measurement {Mi)i such that 
e Cin), Mt > 0, J2^M^ = I and m^{p) = tr(pA/j). 
Notice that the set of all the extreme effects is the set of 
all the projection operator ViTi.)- The POVM measure- 
ment {Pi)i consists of projection operators Pi is called 
a PVM (projection valued measure) measurement. The 
following is an example of GPT which is neither classical 
nor quantum: 

[Hyper Cuboid Systems and squared system] Let 
Scb ■■= {c € R'^IO < c, < l{i = l,...,(i)}. The pure 
states are 2'^ numbers of vertexes. We call this hyper 
cuboid system and especially the squared system when 
d = 2 [111 . These might be the easiest examples of GPT 
which are neither classical nor quantum. However, one 
can construct a classical model such that a suitable re- 
striction of measurements reduces the hyper cuboid sys- 
tems dn. 

Finally, notice that the probabilistic theories with state 
spaces Sa and Sb a-i'c equivalent if they are affine isomor- 
phic, i.e., there exists a bijective affine map from Sa to 
Sb- For instance, any GPT which has a simplex state 
space is affine isomorphic to some standard simplex, and 
therefore can be considered as a classical system. 



III. DISTINGUISHABILITY MEASURES FOR 
GENERAL PROBABILISTIC THEORIES 

In this section, we introduce several distinguishability 
measures (Kolmogorov distance. Fidelity, Shannon dis- 
tinguishability etc) for GPTs. The corresponding mea- 
sures for quantum systems are proved to be useful in 
quantum information theories [24| . It is indeed straight- 
forward to generalize them to any GPT using the no- 
tions developed in the preceding sections, and some of 
them has been used in references [l3. El, [13 ■ Most 
of the properties for quantum systems preserves to be 
hold including the ways to prove them [24| . However, we 
think it useful to sum up these measures, especially Kol- 
mogorov distance and fidelity, for GPT systematically 
and all the proofs of this section are put in Appendix |^ 
for the reader's convenience. A striking thing is that all 
the below results does not resort to ingredients such as 
vectors and operators on a Hilbert space, but only to the 
analysis of probabilities. 

All the measures below are based on those for classical 
systems among every possible measurements of observ- 
ables: In the following, let S be the set of states (state 
space), and £ = £{S), M = M{S) be the sets of effects 
and measurements on S- 



A. Kolmogorov Distance in GPT 

The Kolmogorov distance Dc{p, q) is known to serve 
as a good distinguishability measure between two prob- 
ability distributions p = (j>i)i and q = {qj)j '■ 

Dc{p,q) ■■= \ j2\P'~ 

i 

Indeed, Dc has a metric property and it follows that 
Dcipi, qj) ~ max5 \'p(S) — q(S)\ where the maximization 
is taken over all subsets S of the index set {z}. Thus 
Dc{p, q) is considered as a metric for two probability dis- 
tributions with an operational meaning. 

In any GPT, one can define [31 the Kolmogorov dis- 
tance between two states si, S2 S 5 by 

D{si,S2):= max D ,{pi{M) , p2{M)) , (I) 

A/=(mi)eX 

where pi{M) := {mi{si))i and P2{M) := {mi{s2))i are 
probability distributions to get ith output of the mea- 
surement M under states si and S2, respectively. The 
maximization in ([T]) is always attained by some mea- 
surement, which we call an optimal measurement, due 
to the compactness of the effect set [l^. Notice that 
D{si,S2) is a metric of S, i.e., (i) D{si,S2) > ; 
equality iff si — S2, (ii)-D(si, S2) ~ D{s2,si), and (iii) 
D{si, S3) < D{si, S2)+D(s2, S3), and it is bounded above 
from 1, i.e., < £'(si,S2) < 1. These follow from 
a metric property of Dc and a separation property of 
states. The above mentioned operational meaning of Dc 
also gives D{p^ a) an operational meaning; that is the 
maximum difference of probability among all the event 
S and all measurements. In quantum systems, D{pi,p2) 
is the trace distance between density operators pi, P2'- 
D{p^,P2) = \iT\pi- P2\^ where \A\ -.^ ^/WA. 

For any measurement M = {mi) and states si,S2 € 
S, one can consider a two valued measurement M2 = 
(?Ti_|_,m_) where ?7i_|_ := ^^"-^ 

"^liex- '^it^ ^+ I "iT^ii^i) ~ ™'i{s2) > 0} and 

X_ :~ {i I TOi(si) — mi(s2) < 0}. Using this, one has 
another characterization of the Kolmogorov distance: 

i5(p,(T)=max(e(p)-e(a)). (2) 

The quantity in the right-hand side is a metric used in 

Let P5(si,S2) be the maximal success probability to 
distinguish two states si and S2 in a single measurement 
under the uniform prior distribution. Without loss of 
generality, it is enough to consider two-valued measure- 
ment (mi,TO2) G for a discrimination problem of two 
states si and S2 by guessing si (or S2) when observing 1 
(or 2)th output. Thus, we have 

Ps{si,S2) ■■= max fimi(si) -I- ^7712(52)) 

(mi,m2)GXV2 2 / 

= i(l+max(e(si)-e(s2))). (3) 
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From ([2|) and ([3]), we have another operational meaning 
of the Kohiiogorov distance: 

Proposition 1 For any states si, S2 € S in GPT, 

Note that D{si, S2) takes the maximum 1 iff Ps{si, S2) = 
1, i.e., when si and §2 are completely distinguishable in a 
single measurement. On the other hand, D{si, S2) takes 
the minimum (thus si = S2) iff Psisi, S2) = 1/2, i.e., si 
and S2 are completely indistinguishable (and indeed such 
states should be identified due to the separation property 
of states). 

In the following, we show the monotonicity, strong con- 
vexity, joint convexity, and convexity follow for the Kol- 
mogorov distance in any GPT. 

Proposition 2 (Monotonicity) For any states si,S2 G 
S, and time evolution map A e A{S,S'), we have 

D{suS2) > D{A{si),A{s2)). 

This implies that the distinguishability between si and 
S2 cannot be increased in any physical means. Notice 
that it is well known that the trace distance in quantum 
systems has the monotonicity property under any trace 
preserving completely positive map |25| . Proposition [2] 
generalizes this for any trace preserving positive map. 

Proposition 3 (Strong convexity) Letp — {pi)i andq ~ 
{qi)i be probability distributions over the same index set, 
and Si,ti €z S be states of GPT with the same index set. 
Then, it follows that 

Di^^Pi^i^^li^^) ^ Dc{p,q) + ^PtD{s„ti). 

i i i 

As corollaries, we have 
Corollary 1 (Joint convexity) 

^(y^ PtSj.y^^Pit^ <'^PtD{si,ti). 

i i i 

(As a special case pi = qi of the strong convexity.) 
Corollary 2 (Convexity) 

D{^p,s,,t^ <Y,P^D{s,,t). 

i i 

(As a special case ti — t of the joint convexity.) 

B. Fidelity in GPT 

The Bhattacharyya coefficient (the classical fidelity) 
between two probability distributions p ~ {pi)i and q = 
{qj)j is defined by : 

Fcip,q) ■■=^Vp^- (4) 



Note that (i) < Fc{p,q) < 1 where Fc{p,q) = 1 iff 
p = q; (ii) Fc{p,q) ~ Fc{q,p). We say two probabihty 
distributions p, q are orthogonal iff Fc{p, q) = 0. 

In any GPT, one can also define the fidelity 0, [T^ 
between two states si, S2 G 5 as 

F{si,S2)^ inf FMM),p2{M)), (5) 

M={mi}eM 

where pi{M) (mi(si))i and P2{M) := (TOi(s2))i- 
Contrast to the Kolmogorov distance, the attainability 
of the infimum of the fidelity seems to be nontrivial. In 
quantum mechanics, one has the formula F{pi,p2) ~ 

tr|pJ^^P2^^l = ^iKfh^ P^pY'^y^'^] between two density 
operators pi, p2 [25l.l26l|. Also, it is shown that an optimal 
measurement (POVM) exists which attains the infimum. 

From the property of the Bhattacharyya coefficient 
and the separation property of states, it follows that (i) 
< F{si,S2) < 1 where -F(si,S2) = 1 iff si = S2; (u) 
^"(51,52) = F{s2,si). We say that states si and S2 are 
orthogonal (si _L S2) iff F{si,S2) ~ 0. 

Proposition 4 (Monotonicity) For any states Si,S2 G 
S, and time evolution map A G AiS,S'), it follows 

F(A(si),A(s2)) > F(si,S2). 

Proposition 5 (Strong concavity fl^ l) Letp ~ {pi)i and 
q = {qi)i be probability distributions over the same index 
set, and Si,ti € S be states of GPT with the same index 
set. Then, 

i i i 

As corollaries, one gets 

Corollary 3 (Joint concavity and concavity) 
F(^PtSi,^Pit,'j >^PtF{si,ti), 

i i i 

F(J2P^S^,t) >Y.P^F{s,,t). 

i i 

Proposition Q In a bipartite system A + B, we have the 
followings: 

(i) F{sA,tji) > F{s,t) for any s,t G Sa®Sb where 
SA and tA are the reduced states to the system A. 

(ii) F{si,S2)F{ti,t2) > F{si ^ ti,S2 <^ ^2) for any 
$1, S2 € SA,ti,t2 e Sb- 

(Hi) F(si, S2) — F{si(E)t, S2<E)t) for any si, S2 G SA,t G 
Sb- 

In particular, from (ii), it follows 

F{s,tf > F{s(E)s,t(E)t), (6) 
by letting S Sa ~ Sb and s = si = ti G 5 and 

t = S2 = i2 e 5. 

Note that the generalization of properties of Proposi- 
tion [5] is straightforward for multipartite system. 
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However, contrary to the Kolmogorov distance, it is 
difScult to give an operational meaning for the Fidehty, 
since there is no known operational meaning of Bhat- 
tacharyya coefficient. In using the Fidelity, it is impor- 
tant to know the relation with another operational mea- 
sures like the Kolmogorov distance. 

C. Relation between the Kolmogorov Distance and 
the Fidelity 

Proposition 7 For any state s,t E S, it follows 

1 - t) < D{s, t) < ^1-F{s,t)^. (7) 

This relation is famous to hold in quantum systems (23 . 
[25| . but Proposition[7]shows that this holds for any GPT. 
From ([7]), we have 

Corollary 4 (i) D{s,t) = iff F{s,t) = 1 and (ii) 
D{s,t) ~ 1 iff F(s,t) = 0. In particular, the orthogo- 
nality of states turns out to be equivalent to the complete 
distinguishability of states (Pg — 1). 

In this sense, the Kolmogorov distance and the fidelity is 
equivalent. 

Similarly, it is straightforward to introduce another 
measures which are used in quantum information theory. 
For instance, one can define Shannon distinguishability 
and can show the same relations (see for instance Theo- 
rem 1 in [131). 

IV. APPLICATIONS 

In this section, we give simple proofs using the Fidelity 
for no-cloning theorem and information-disturbance 
theorem [H, in any GPT. 

Theorem 1 (No-cloning) In any GPT, two states 
Si,S2 S S are jointly clonable iff si ~ S2 or si and S2 
are completely distinguishable. 

Proof Let states si,S2 €5 are jointly clonable. Namely, 
there exists a time evolution map (a cloning machine) 
A e A{S, S®S) satisfying 

A(si) = Si® Si, A(S2) = S2® S2- (8) 

From (l6|), we have 

F(A(S1),A(S2)) =F(S1® Si,S2®S2) < F{si,S2f. 

From the monotonicity of F, it follows that F{si,S2) < 
F(A(si), A(s2)) < -F(si,S2)^, which implies that 
^(si,S2) = or 1. In other words, si = S2 or si and 
S2 are completely distinguishable (cf. Corollary |4]). 

Suppose that si = S2, then one has a time evolu- 
tion map A e AiS,S<^S) defined by A(s) := s (g) si. 
(Physically, this is nothing but a preparation of a fixed 



state si.) It is obvious that this jointly clones Si and 
S2- Next, suppose that si and S2 are completely dis- 
tinguishable. Namely, there exists a measurement M — 
{mi, m2) G M{S) such that mi(si) = 1, toi(s2) = (and 
thus m2(si) = 0,7712(52) = 1). Then, A(s) := 777i(s)si (g) 
si -t- 7772 (s)s2 (X) S2 for any s G 5 defines a time evolution 
map A e A{S, S ®S) satisfying the cloning condition 
([5]). (Notice that mi{s),m2{s) > 0,mi{s) -I- 7772(5) — 1 
and thus r77i(s)si (g) si + ?772(s)s2 ^ S2 G S(^S from the 
convexity of S<E)S- The affinity of A follows from the 
affinity of 777.) ■ 

Lemma 1 For any GPT with at least two distinct states, 
there exists two distinct states which are not completely 
distinguishable. 

Proof Let si 7^ S2 E S- Assume that any two distinct 
states are completely distinguishable. Then, we have 
^(■si,S2) = 0. From the convexity of S, there exists a 
state s := ^si + ^S2 ^ si. From the concavity of F, 
we have F(si,s) > ^F{si,si) -f ^F{si,S2) = \- There- 
fore, si and s are distinct states which are not completely 
distinguishable. ■ 
We call a physical process which clones any unknown 
states a universal cloning machine : 

Proposition 8 (No-cloning) In any GPT with at least 
two distinct states, there are no universal cloning ma- 
chine. 

Proof This follows from Theorem [T] and Lemma [T] ■ 
In a usual application, cloning is often considered for 
only pure states. We call a physical process which clones 
any unknown pure states a universal cloning machine for 
pure states : However, such cloning is possible if and only 
if GPT is classical: 

Proposition 9 GPT is classical iff there is a universal 
cloning machine for pure states. 

Proof Notice that classical systems are characterized 
by the fact that all the pure states are completely dis- 
tinguishable Q. This fact and Theorem [T] complete the 
proof. I 

Theorem 2 (Information disturbance) In any GPT, any 
attempt to get information to discriminate two pure 
states which are not completely distinguishable inevitably 
causes disturbance. 

Proof Let si, S2 € Sa be two pure states which are not 
completely distinguishable, i.e., < F{si,S2). Assume 
that there is a physical mean to get information to dis- 
criminate si,S2 without causing any disturbance to the 
system. This implies that we have a time evolution map 
A e A{Sa,Sa ® Sb) and initial states t^ S Sb such that 
the reduced states to system A is the same: 

A(si (g) to) A = Sl, A(S2 ® to)A = S2. 
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Since si,S2 are pure states, there exists no correlations 
between system A an B, and hence one gets 

A(si (g) to) = Sl ® h, A(S2 ® to) = S2® t2, 

for some ti,t2 G Sb- From the monotonicity of F and 
Proposition [6l it follows that F{si, S2) = ^"(51 ® to, S2 ^ 
to) < F(A(si(g)to),A(s2®to)) = F{si,S2)F{ti,t2). Since 
< i^(si,S2), we have F{ti,t2) = 1 and thus ti = ^2- 
Therefore, to get information to distinguish si and .S2, 
one has to inevitably disturb at least one of these states. 
■ 

No cloning theorems are discussed in 0] with com- 
pletely different methods. In [l^, we have proved The- 
orem [5] using the Kolmogorov distance. Essentially the 
same proof as above is given in [T^ . 

V. INDECOMPOSABLE AND COMPLETE 
MEASUREMENT IN GENERAL 
PROBABILISTIC THEORIES 

A. Indecomposable Effect 

In quantum systems, a fundamental POVM element 
E is that with one dimensional range, called a rank-one 
POVM element. Let us define the corresponding notions 
in any GPT, which we are going to call an indecomposable 
effect: 

Definition 1 We call an effect e G £{S) indecomposable 
if (i) e ^ and (ii) for any decomposition e = 61 + 62 into 
the sum of two effects ei and 62, there exists c G R such 
that 61 = 66. We denote the set of all the indecomposable 
effects on S by E.,nd{S) C EiS)- 

It is easy to see that the above mentioned c satisfies < 

6 < 1. 

Here, we show some general properties of effects and 
indecomposable effects: 

Proposition 10 Let e be a non-zero pure effect on S- 
Then, there exists a state s such that e{s) ~ 1. Since S 
is compact, such state can be taken to be a pure state. 

Proof Suppose that there are no state s such that e{s) = 
1. Then, from the compactness of S, we have 

sup6(s) = max6(s) =: a; < 1. 

From this, e := e/x is an effect which is neither e nor 
zero effect 0. Since we have the identity, 

6 = xe -I- (1 — a;)0, 

this contradicts that 6 is a pure effect. 

Let s = J2iPiSi (Pi > 0,J2iPi = 1) be a pure state 
decomposition of s. Then, it is easy to see e{si) = 1 for 
any pure state Si. Thus, we can take a pure state s such 
that 6(s) = 1. ■ 



Corollary 5 Let e be a pure effect which is not u. Then, 
there exists a state s such that e{s) = 0. Such state can 
be taken to be a pure state. 

Proof Since e{j^ u) is pure, the effect e = u — 6 is non- 
zero pure effect. From Proposition 1101 there exits a pure 
state s such that e(s) = 1 — e{s) = 1. Thus, e{s) = 0. ■ 
Next, we show that any non-zero effect has a decom- 
position with respect to indecomposable effects: 

Proposition 11 In any GPT, for every 7^ 6 e £{S), 
there exist a finite collection of indecomposable effects 
Si G £{S), i < i < r, such that e = J2i=i ^i- ^'^ particu- 
lar, in any GPT, there exists an indecomposable effect. 

(See Appendix A for the proof.) Moreover, we have: 

Proposition 12 In any GPT, there exists an indecom- 
posable and pure effect. 

Proof To prove this, we use the following lemmas: 

Lemma 2 Let e €z £ be an indecomposable effect and let 
q := maxsg^ e(s). (Note that < q < 1.) Then, e :~ ^ 
is an indecomposable effect. 

Lemma S If e is indecomposable effect such that there 
exists a state s satisfying e{s) — I, then e is a pure effect. 

(Sec appendix A for the proofs.) From Proposition 
there exists an indecomposable effect. From Lemma [21 
one can construct an indecomposable effect from any in- 
decomposable effect such that e(s) = 1 for some pure 
state. From Lemma [31 it is an indecomposable and pure 
effect. ■ 

In the following, we give a characterization of indecom- 
posable effects in classical, quantum and hyper cuboid 
systems in order: 

[Classical Systems] Let Sc be the state space of a clas- 
sical system introduced in section [TT] Remind that any 
state p ~ {pi, . . . ,pd) G Sc has the unique decomposi- 
tion with respect to pure states: s = ^^Pfj.p'^^^ ■ Thus, 
an effect e on Sc is completely characterized by d num- 
bers of value x^ := 6(p('^') G [0,1] (/z = 1 ,d). 

Conversely, for any given x^ G [0,1] (/i = l,...,d), 
there exists an effect e such that e{p^'^^) = x^. Let 
6^'") G S{Sc) (m = l,---,rf) be the effects defined by 
g(M)(p("^)) — g^^^ classical systems, the indecompos- 
able effect is characterized as follows: 

Proposition 13 An effect e G £{Sc) "is indecomposable 
iff there is one pure state at which the value of effect is 
non-zero. In other words, £ind{Sc) is characterized by 
5md(5c) = {A6('^) I AG (0,l],Ai= l,...,d}. 

[Proof] First, let e be an effect such that there exists 
one pure state, say p^, at which the value of effect is 
non-zero. Then, one has 6 7^ and e = A6^'^' G £iSc) 
for some A G (0,1]. Let 6 = / + 5 for /, g G £{Sc)- 
Then fip'-"^) = for any v ^ ji, and it follows that 
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/ = — -e. Therefore, e is indecomposable. Next, let e 
be indecomposable effect. Assume that there are at least 
two non-zero pure states, say p(^o)^p(Mi) ^ ^-^ — 
l,...,d) at which the effect values are non-zero. Let 
Xfj, := e(p^^'). Let /, g be effects defined by /(p^''^) = 
Xfio6fi^g and g = e — f. Obviously e ^ cf for any c e M, 
and it contradicts that e is indecomposable. Since e ^ 0, 
there is the only one pure state at which the value of 
effect is non-zero. I 
[Quantum Systems] Next, we show that indecompos- 
able effects for quantum systems are characterized by an 
one dimensional projections, i.e., rank-one POVM ele- 
ment. Let Ti be the d dimensional Hilbert space and let 
Sq be the set of all the density operators on 7i. We call 
a non-zero POVM element E indecomposable iff the cor- 
responding effect e(-) := tr(£'-) is indecomposable, ft is 
easy to see that a POVM element E is one dimensional 
iff there exists A S (0, 1] and a unit vector ip G H such 
that E = X\i^){i'\. 

Proposition 14 A POVM element E £q is indecom- 
posable if and only it is a rank- one POVM element. 

Proof Let E — A|V')(V'| be a rank-one POVM element 
with a unit vector ip £ H and A G (0, f ] . Let E = E1+E2 
for some POVM elements £'1, i?2: 

AIV'X^'I =^1+^2. (9) 

Let {ipn}n be an orthonormal basis of H such that 
V'l = ip. Then, from ©, it follows that {^j\Eitpj) = 
||i;y2^^-||2 = (Vj > 2), and hence Ei^Pj = (Vj > 2). 
For any ^ G we have Ei^ £^i(E„('0n|OV'n) = 
{iplOEiip = \Ei'iP){iP\C Thus, El has the form of |0)(V'l 
(where (j) ■= Eiip). Finally, since Ei is Hermitian, it 
follows that there exists c' S M such that (j) = c'lp and 
hence Ei = c'\ip){ip\ = cE where c := y. This implies 
that E is indecomposable. Next, let E be indecompos- 
able. Assume that E is rank I POVM element for some 
I > 2, and let E = Cn|V'n)(V'n| (c„ € (0,1]) be an 

eigenvalue decomposition of Let i?i :— ci\^pi) \ and 

^2 := I]L=2 c«IV'«)('0n|- Obviously, they are POVM el- 
ements satisfying E = Ei -\- E2. However, for any c G M, 
we have E ^ cEi (For instance, £^"02 = C2 7^ while 
cEiip2 = 0). This contradicts that E is indecomposable. 
Since i? ^ 0, we conclude that E is rank one POVM 
element. ■ 

[Hyper cuboid systems] Finally, let Sch be the state 
space of a d dimensional hyper cuboid system introduced 
in section [TTl To determine the indecomposable effects in 
iScb, we present a general lemma which is also useful in 
later arguments (see Appendix [Xl for the proof): 

Lemma 4 // the state space S of a GPT contains at least 
two states, then for every indecomposable effect e G SiS) 
we have e{s) = for some s € S- 

By virtue of this lemma, we obtain the following charac- 
terization of indecomposable effects in Sch- 



Proposition 15 An effect e G £{Sch) is indecomposable 
if and only if it is nonzero and it takes at a d — 1 
dimensional face (facet) of Sch- 

Proof First we consider the 'if part. Suppose that e 
is nonzero and e takes at a facet F of iScb- Fix a 
state s G iScb such that s ^ F. Note that e(s) > 
since e is nonzero. If e decomposes as e = ei -I- 62 with 
ei, 62 G £{Sch), then ei (hence 62) also takes at F. This 
implies that ei = Ae where A = ei(s)/e(s) G R, hence e 
is indecomposable. 

Second, we consider the "only if" part. By Lemma IH 
an indecomposable effect e takes at some state, hence 
at some pure state in Sch- I3y symmetry, we may assume 
without loss of generality that e(so) = where sq = 
(0, 0, . . . , 0) G Sch- Let (i G {1,2,..., d}) be the vertex 
of iScb such that its j-th component is Sij . Then we have 

e = I^iLi e{si)ei where G £{Sch) maps (ci, C2, - - - ,Cd) 
to Ci. Since e is indecomposable, it follows that e = Ae^ 
for some 1 < i < d and A G M, therefore e takes at the 
facet {(ci, ---,Cd) & Sch\ Ci = 0} of Sch- ■ 
For example, the indecomposable effects in the squared 
system (i.e., when d = 2) are listed in Table [H where 
ai, . . . , a4 G (0, 1] are parameters. 

TABLE L Indecomposable effects in Sch, d = 2 







value at 




effect 


(0,0) 


(0,1) 


(1,0) 


(1,1) 


ei 












62 


a2 


Q2 








63 





as 





as 


64 


04 





04 






B. Indecomposable and complete measurements 

Using the indecomposable effects defined above, we de- 
fine an indecomposable measurement in any GPT as fol- 
lows; 

Definition 2 In a GPT with a state space S, we say 
that a measurement M = (jnj)j is indecomposable if all 
TTij G £{S) are indecomposable. The set of all the inde- 
composable measurements is denoted by A^ind(iS) or sim- 
ply by Mind- 

From Proposition 1141 an indecomposable measurement 
is a generalization of a one-rank POVM measurement in 
quantum systems. 

Proposition 16 In any GPT, there exists an indecom- 
posable measurement, i.e., A^ind(5) 7^ 0- 

Proof From Lemma [TTl ^ decomposition of the unit 
effect u with respect to the indecomposable effects gives 
an indecomposable measurement. ■ 
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In quantum systems, rank-one PVM measurement 
plays a fundamental role in the foundation of quantum 
physics, which describes a measurement of a non degen- 
erate Hermitian operator. One can also define the corre- 
spondent notion in any GPT, which we call a complete 
measurement, as follows: 

Definition 3 In a GPT with a state space S, we say that 
a measurement M = {mj)j is complete if all mj e £{S) 
are indecomposable and extreme. The set of all the com- 
plete measurements are denoted by A^comp(5), or simply 
by A^comp- 

It is easy to see that the set of extreme effects for 
classical systems are characterized by 5c) ex = {e € 
£{Sc) I e(p(')) = 1 or (i 1, . . . , d)} [3|. Hence, from 
proposition [T31 there is essentially the unique complete 
measurement in classical systems given by Mcomp ■= 
(e(j))j, where eJ{p^'''>) := Sjk (j, fc = l,...,d). More 
precisely, M = (jnj)j is a complete measurement iff 
rrij = e'^^-'-' where a{j) is a permutation of {l,...,d). 
On the other hand, in quantum systems, a complete mea- 
surement is given by a rank-one PVM measurement. This 
follows from Proposition [T3] and the fact that a POVM 
element is extreme iff it is a projection operator. 

By definition, A^comp(5) C Mind{S)- However, the 
existence of the complete measurements does not neces- 
sarily hold for any GPT (See Appendix [B] for a counter 
example) . 



VI. SOME QUANTITIES RELATED TO AN 
ENTROPY 



In this section, we consider three quantities on S in any 
GPT which are related to the notion of entropy. Indeed, 
all of them coincides with the Shannon entropy H and 
von Neumann entropy S in classical and quantum sys- 
tems, respectively, and therefore give generalizations of 
entropies in classical and quantum systems. However, as 
is shown, they do not coincide in some GPTs. and does 
not satisfy some of properties of an entropy. In the follow- 
ing, let H{p), or simply as H{pi), denote the Shannon 
entropy for a probability distribution p = (pi, . . . ,pd)- 
H{p) := — J2iPi log Pi- We also denote it by H{X) when 
the random variable X are dealt with. The mutual in- 
formation for a random variable X and J are denoted by 
H{X : J) := H{X) + H{J) - H{X, J). In quantum sys- 
tems, the von Neumann entropy for a density operator p 
on Ji is denoted by S{p) := — tr(plogp). 

Let us consider a general GPT with a state space S- 
For any state s G 5, we denote by T){s) the set of all the 
ensembles {px]Sx}x [sx G S,Px > 0,J2xPx = 1) ^'^^^^ 
that s = '^^PxSx- The set of all the ensembles for s 
with respect to pure states are denoted by V{s) C V{s); 
i.e., {px;sx} e V{S) ^ s = J2 xPx^xj -^x G Spure- NotC 
that H and S have a concavity property. Both H and S 
are positive and take the minimum value iff the state 
is pure. The following upper bound of von Neumann 



entropy is also well known: for a probability distribution 
{pi)i and a set of density operators {pi}i, 

SCZ^P^Pi) < H{p^ + y^p,S{pi), (10) 



with equality iff density operators pi are orthogonal to 
each other. See, for instance [1^, for the properties of 
Shannon and von Neumann entropies. 

In any GPT, let us define the following quantities for 
s G 5: 



(11) 



Si{s) := inf H{mj{s)). 

S2{s) := sup sup H{X : J). (12) 

S^{s) := inf H{px). (13) 

In 5*2(5), H{X : J) is defined by a joint distribution 
Px'mj{sx) with an ensemble {px,Sx} G V{s) and a mea- 
surement M = {mj)j. From the definition and the pos- 
itivity of the Shannon entropy and the mutual informa- 
tion, the positivity of 81,32,83 are obvious. It is easy 
to see that 5*2 can be redefined with respect to T>{s) and 
M- 

Lemma 5 We have 

82(3) = sup sup H{X : J) 

Proof A straightforward computation shows that, for 
any {px,Sx} € T>{s) and M = {mj)j £ J\4., the value of 
H{X : J) is not decreased by replacing {px, Sx} with the 
pure state decomposition of s obtained by decomposing 
every Sx into pure states, and by replacing M with the 
indecomposable measurement obtained by decomposing 
every mj into indecomposable effects (cf. Proposition 
[TT|) . This implies the desired relation. ■ 
However, note that it is essential to use Mind and Vis) 
for the definitions of 81 and 83. Indeed, redefinitions of 
81 and 53 with respect to T>{s) and M give trivial quanti- 
ties: mfMeMH{mj{s)) =0, inf{p^,s,}Gl?(s) ^^({Pa;}a;) = 

0. 

Notice that all three quantities (|lip - (fT5|) are defined 
with physical languages: 8i{s) measures the minimum 
uncertainty of measurement among indecomposable mea- 
surements under a state s; >S'2(s) measures the max- 
imum accessible information (by an optimal measure- 
ment) among any preparation of s (See below). Finally, 
£'3(3) measures the minimum uncertainty for a prepara- 
tion of s with respect to pure states. 

Indeed, under the preparation of states Sx with a prior 
probability distribution px, the accessible information 
I{{Px,Sx}) is defined by supM=(™^)^.e7v, H{X : J) where 
the joint probability distribution between X and J (mea- 
surement outcome by a measurement M = {mj)j ) is 
given by p{x,j) := Px^^jisx)- Therefore, from Lemma[5l 
we have 82{s) = sup{p^ I{{px,Sx}), and thus 
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Proposition 17 In GPT, for any preparation of states 
{PxtSx}, the accessible information is bounded as 



I{{Px.S.x})<S2{s), 



(14) 



whe 



Notice that, in quantum systems, the Holevo bound 
[2?! gives an upper bound of the accessible information 
by the Holevo x quantity: For a preparation of density 
operators with a probability distribution p^, 



I{{Px,Px}) < X := S{p) - ^PxS{px) 



(15) 



In the following, we see that S2 coincides with the von 
Neumann entropy in quantum systems. Thus, p4p gives 
a looser bound than the Holevo bound in quantum sys- 
tems. (For the pure state ensemble, (fT4| gives exactly the 
Holevo bound since the von Neumann entropy vanishes 
on pure states.) 

Now, we show that all three quantities pT|) - P^ are 
generalizations of Shannon and von Neumann entropies 
in classical and quantum systems: 

Theorem 3 (i) In classical systems, Si{s), 82(8), S3{s) 
are the Shannon entropy. (ii) In quantum systems, 
51(5), 52(5), 5*3(5) are the von Neumann entropy. 

Proof (i) Let Sc be the state space of a classical sys- 
tem. From Proposition ! 131 any indecomposable measure- 
ment in classical system is given by (A^ 



where 
, d. Thus, 



= 1 for any = 1, 
for a state p = {pi, . . . ,pd) G Sc, the probability dis- 
tribution given by the indecomposable measurement is 
{K,iiPfj.)i,fj.- Note that from the concavity of the func- 
tion g(x) := —xlogx {x G [0,1]) with the convention 
5(0) = 0, it holds that g{\x) > Xg{x), and thus we 
have if(Ai,^p^) = J2i,^9{Ki^Pt^) > J2^(I2i Kp)9{P^i) = 
H{pfj_). Thus, we have Si{p) > H{p). Since (e^'^^)^ is an 
indecomposable measurement with which the probability 
distribution is given by p, we have Si{p) = H{p). 

As mentioned before, a state space of a classical system 
is characterized by a simplex. Thus, we have 

52 (p) - sup H{X : J) 

MEM 

where the random variable X is described by the proba- 
bility distribution p. Remind that the mutual informa- 
tion can be written as H{X : J) = H{X) - H{X\J) 
where H{X\J) denotes the conditional entropy, and it 
follows that 



5*2 (p) = sup H{X 



J) = H{X) 



inf H(X\J). 



Since there exists a measurement M = (rrij) to dis- 
criminate all pure states in a classical system, we have 
infMeM = the uncertainty of X condi- 

tioned on the information of J is zero). Therefore, we 
have S2ip) = H{X) = H{p). 



Again from the unique pure state decomposition, there 
exists the unique ensemble for any state 

p = (pi, . . . ,Pd) e Sc- Therefore, we have 53 (p) = -ff(p). 

(ii) Next, we consider a quantum system described by 
a Hilbert space Tl- First, let / be a concave function on 
[0, 1] such that /(O) = 0, and let p be a density operator 
on 7i- Then, it is easy to show [3^ that for all vector 

&Ti. such that 11-011 < 1, wc have 



fMpm > mipM- 

Let us fix any indecomposable POVM measurement 
{Ej)j on quantum system 7i, i.e., rank-one POVM mea- 
surement. We can write Ej = \'4'j){'>Pj\ with a vector 
■0 e H such that < ll'f/'jil < 1 and |V'j)('0jl = I- 
Remind that the von Neumann entropy of p is defined 
by 

S{p) ■■= tr.g(p) 

with the concave function g[x) := —xlogx with the 
convention g(0) = 0. Applying g to the above con- 
cave function /, we have II{ej{p)) = J2j dii'^jlpl'^j)) — 
EjH^MpM) ~ t^i9{p)E^E^) - tr5(p) ^ S{p). By 
considering the indecomposable measurement given by 
i\(t'j){4'j\)j where arc complete eigenvectors of p, we 
obtain 5i(p) = 5(p). 

Next, from the Holevo bound (fT5l). we have 



52 (p) < Sip) 



inf 



[^PxSipx)) 



Sip). 



The final equality follows from the eigenvalue decompo- 
sition p = J2xP^\4>x){4>x\ and Si\(l)x){4>x\) = 0. Again 
with the decomposition {px,Px = \4'x){4>x\} of eigenval- 
ues and eigenvectors, there exists an optimal measure- 
ment Mj := |0j)(0j| to discriminate px, and thus one 
has HiX : J) = Hip). Since 5(p) = Hip), we have 
Sip)=HiX:J)<S2{p). 

Finally, let {px,Px} G Vip) be a pure state decompo- 
sition of p. Then, from the inequality (|10p and the fact 
that Sipx) = for pure states px, we have 

Sip) < Hiipx}) < Ssip). 

Moreover, an eigenvalue decomposition p = 
J2xP^\'P'>:)i'i'^\ '^^ P gives a pure state decomposi- 
tion such that Px = 1 0a;) (0a; I are orthogonal to each 
other, we have the equality: S^ip) = Sip). This 
completes the proof. ■ 
Notice that the fact that 5i , 5*2 , 53 coincide with the 
von Neumann entropy in quantum systems shows that 
wc have alternative expressions with operational mean- 
ings for the von Neumann entropy. The characterization 
of 5 by 53 has been noticed by Jaynes HI]. Here, we re- 
mark that 5*1 could be defined by the infimum of Shannon 
entropy among not indecomposable measurements but 
complete measurements. Then, it is easy to restate the 
above mentioned proof to show 5i coincides with Shan- 
non and von Neumann entropy in classical and quantum 
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RlL 

Rzu 


RlR 

RiU 


RaB 

RH 


R-iB 
RlR 



FIG. 1: In the squared systems, (1), (2) and (3) show the 
graphs of 5*1, S2 and 53. (4) specifies the region Ril, ■ ■ ■ , Rar- 



systems. However, as we have noticed in Sec. IV B[ there 
exists a GPT where no complete measurements exists. 
This is the reason why we have defined Si among inde- 
composable measurements. 

In order to sec the properties of Si, 82, S3 in a general 
GPT, let us again consider the squared system 5sq- Let 
h{x) := — xlogx— (1 — cc) log(l — x) is the binary Shannon 
entropy: 

Proposition 18 In the squared system, for s — 
(ci, C2) € Ssq, we have 

Si{s) = min[/i(ci),/i(c2)], (16) 
^2(5) =max[/i(ci),/i(c2)], (17) 





k{ci + C2 - 


1) seRiL 




fc(ci) 


s e RiR 




fc(0) 


s e R2U 




fc(ci) 


s e R2B 


fc(c2) 


s e Rsu 




fc(ci + C2 - 


1) seR3B 




fc(c2) 


s e RiL 




fc(0) 


s e RiR 


where k{x) := H{{ 


X, Ci - X,C2- 


- X,l + X — Ci 


the regions Ril, ■ ■ 


■ , i?4_R C Ssq 


are given in 



(18) 



(See Appendix A for the proof.) See graphs of 5*1,52 
and 53 in Fig. [T]-(l)-(3). Moreover, in Ssq, the following 
relations among 51,5*2 and 53 are true: 

Proposition 19 For any s £ Ssq, 

Si{s) < 52 (s) < Ssis) 

(Sec Appendix A for the proof.) 



A. Concavity 

In this section, we consider the concavity properties of 
5i,52 and 5*3. It turns out that 5i is concave on S in 
any GPT, while there exists GPT models where 52 and 
53 are not concave. 

Proposition 20 In any GPT, Si is concave on S- 

Proof Let {px)x (a; = 1, . . . , tti) be a probability distri- 
bution and let Sx & S {x = 1, ■ ■ ■ , m). Then, from the 
affinity of effects rrij and the concavity of the Shannon 
entropy, we have 



PxSx 



= .r , ^"^^ ^(ej(VpxS:.)) 

> inf y^PxH{mj{sx)) 

A/=(m,),eA1.„d^ 

> y^Px inf H{mj{sx)) 



^^VxSl{Sx). 



(19) 



Contrast to 5*1 , 52 and 5*3 does not satisfy the concav- 
ity in some GPT. It is easy to give counter examples but 
it is obvious that concavity does not hold in the squared 
systems from the Fig. [T]-(2) and (3). 

In stead of the concavity, we show the following: 52 
satisfies the following weak concavities: 

Proposition 21 (Weak Concavity) In any GPT, S2 sat- 
isfies the foUowings: 

S2[2_^PxSx) > ^ F-r^, 20) 

X 22xPx^2{Sx) 
52(Ep-^-)>E?''^2(s.), (21) 

X X 

S2C^PxSx) > ^Px^^iSx), (22) 

X ' ' X 

S2{y^PxSx) > ^a-^PxS2{sx), (23) 

^ ^ X 

X 

for any {px, Sx}xex G T>{s) (in Ii20\) . we interpret the 

right-hand side as if S2{px) are all 0). 

Proof To prove this proposition, we use the following 
lemma (see Appendix \K\ for the proof): 

Lemma 6 Let {px,Sx}xgx G V^s), Px 0. Then for 
any value tTx > 0, x € X such that J2x ""^ = I7 we have 

S2iy^^PxSx) > y^^TTxPxS2{Sx) ■ 

X X 

By using this lemma, (|20p is proved by putting tt^, = 
PxS2isx) / {J2x' Px' ^"^(^x')) (here we may assume that the 
denominator of the tt,. is nonzero, as otherwise the claim 
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is obvious); ([21]) is proved by putting tt^ ^ Px] p2|) 
is proved by putting tt^ — 1/|X|; and (^5)1 is proved 
by putting — S^xq where xg € X is such that 

PxoS2{Sxo) = TCLa.^xPxS2{Sx)- ■ 

Proposition 22 /n any GPT, 6*3 satisfies 

m m 

SsC^PiSz) < H{p,) + ^PtS^{s{), 

i=l i=l 

for any Si S {i — ■ ■ ■ , m) and probability distribution 

Proof Let e > be an arbitrary positive number. 
Let s := Yl'^iPiSi and let {p*;s*}j € V{si) be an 
"optimal" decomposition of s,; such that Sz{si) + e > 
H{p)). Since {ptP)}r,j e V{s), we have 53(5) < 
H{I,J) := -J2^jP^Pj^og{p^p)) ^ -X^j-P'logP* + 

E^P^{- EjPj logPp = HiPx) + E,P^H(j^) < H{px) + 

T,iPtS3{st) + e- ■ 
Thus, 53 satisfies the same upper bound pO|) of the 
von Neumann entropy in any GPT. 

B. Measure for pureness 

Since both the Shannon entropy and the von Neumann 
entropy vanishes if and only if the state is pure, they can 
be considered as a measure of pureness. (Note also that 
they take the maximum value iff the state is the maximal 
mixed states.) We show that 5*2 and ^3 has this desired 
property in any GPT, while Si docs not satisfy this in 
general. 

Proposition 23 In any GPT, (i) 82(3) = if and only 
if s is pure, (ii) S'3(s) = if and only if s is pure. 

Proof (i) Let s S 5 be a pure state. Since s is an ex- 
treme point of iS, Vip) has essentially the unique (trivial) 
decomposition: {1; s} with H{X) = 0. Thus, we have 

52 (s) = sup H{X : J) = -ini H{X\J) <Q. 

M={7nj)jeM ^' 

To see the converse, let S{s) = for s G 5 and let s = 
PiSi +P2S2 where pi S (0, l),_pi +P2 — ^ and si, S2 G S. 
Since 6*2(5) = and {px', Sx}x=i.2 G D{s), we have 

H{X : J) 

for the random variable X ~ 1,2 and for any M ~ 
(rrij) e A4. This implies that the joint probability 
p{x,j) :— Px'nij{sx) is a product state, or equivalently, 
the conditional probability p(j|x) := p{x, j)/px =rnj(sx) 
is independent of x (Notice that pi,P2 7^ 0). In partic- 
ular, we have mj(si) = p{j\l) = p(j|2) — mj{s2). Since 
this folds for any effect ruj, we have si = S2 from the 
separating property of states. Therefore, s has only the 
trivial decomposition and is a pure state. 



(ii) Let s € iS be a pure state and thus T'(s) has essen- 
tially the unique (trivial) decomposition: {l;s} where 
H{X) = 0. Thus, we have ^3(5) H{X) = 0. Con- 
versely, let 5*3(5) = 0. Then, for any {px;Sx}x G 7^(5), 
it follows H{X) = 0. Assume that s is not a pure state. 
Then, we have {px,Sx}x e ViS) where pxi,Px2 > 
for some xi,X2 G X. However, this contradicts that 
H{X) — 0. Therefore, s is a pure state. 

■ 

Contrast to 52 and 63, 5i does not satisfy this prop- 
erty. For instance, from 5i(s) = for any state 
s on the boundary (four edges) of 5sq- (See Fig. [I]-(l)- 
Note that s on edges but not on vertexes is not a pure 
state.) In general GPT, we show the foUowings: 

Proposition 24 In any GPT, 5i(s) = implies that s 
is on the boundary of S- 

Proof It suffices to consider the case that S has at least 
two states. To prove this proposition, we use the follow- 
ing two lemmas (see Appendix \K\ for the proofs): 

Lemma 7 Let k,£ > 1 be an integer. Let h{x) = 
— a;logx-. Ifxi,...,xi G [0, 1/fc] andj^j^j = 1; then 
H{x)=j:,Hxj)>logk. 

Lemma 8 For any s e S, the map fs : £{S) x 5 ^ R, 

fs{e,t) ~ e(s) — e{t), is continuous. 

Let s G 5 such that 5i(s) = 0. First we show that 
sup(-g J) /s(e, i) = 1. Let A: > 2 be any integer. Since 
5i(s) = 0, there is an indecomposable measurement 
M ~ (mi)i G A^ind such that II{mi{s)) < h[\/k) 
(< logfc). Then we have mi{s) > 1 — 1/fc for some 
i, as otherwise we have a contradiction as follows: If 
TOig(s) G (1/A:,1 — 1/fc) for some io then we have 
H{mi{s)) > h{mif,{s)) > /i(l/fc); while if mi(s) < 1/fc for 
all i then we have H{mi (s)) > log fc by Lemma[71 For this 
iTii, Lemma |4] implies that there is a state t S such that 
mi(t) = 0. This implies that fs{mi,t) > 1 ~ 1/fc- Since 
fc > 2 is arbitrary, we have supj-^ j-, /^(e, i) = 1. Since 
£{S) X 5 is compact. Lemma H] implies that fs{e,t) = 1 
for some e G £{S) and < G 5, therefore e(s) = 1 and 
e{t) — 0. This implies that e is not constant and s lies in 
a supporting hypcrplanc of S, hence s is on the boundary 
of S as desired. ■ 
Note that, in Ssq, the converse is also true: all states on 
the boundary s satisfy 5i(s) = 0. However, this is not the 
case for any GPT. In particular, one can construct a GPT 
where 5i(s) ^ even for a pure state s. For instance, 
consider a GPT introduced in Appendix [B] with state 
space S C R^, which has the four pure states (0, 0), (1, 0), 
(0, 1), and (2, 2). Then any indecomposable effect in S is 
of the form Ae^ such that < A < 1 and is one of the 
four effects in Table HIl in Appendix IB] This implies that 
for any indecomposable measurement M = (m^) ; G A^ind 
we have TOi(0,0) < 2/3 for all i, therefore 5i(0,0) > 
(see Lemma[7]). Thus, in general GPT, neither directions 
of "s is pure 5i(s) — 0" does not holds in general. In 
the next section, we consider a class of GPTs with fairly 
fine property. 
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VII. PRINCIPLE OF EQUALITY FOR PURE 
STATES AND SYMMETRIC GPT 

In the last part of the preceding section, we considered 
a GPT where for some pure state s it holds that Si (s) > 
(See GPT in Appendix B). However, the structure of 
state space are asymmetric and might be just toy models 
for GPTs. On the other hand, both classical and quan- 
tum systems has a certain class of symmetric structures: 
In particular, there are no special pure states which have 
different properties from another pure states. We call 
this the principle of equality for pure states and can be 
formulated as follows: 

Definition 4 (Equality for pure states) We say that 
GPT satisfy the principle of equality for pure states if 
for any pure states Si,S2 € S, there exists a bijective 
affine map f on S such that S2 = /(si). We call a GPT 
satisfying this property a symmetric GPT. 

It is easy to see that: 

Proposition 25 Classical, quantum, and hyper cuboid 
systems are all symmetric. 

In particular, notice that, in quantum systems for any 
pure states pi = \ipi){ipi\, p2 = \tp2){ip2\, there exists a 
unitary operator U such that p2 = UpiW. 

We show that Si vanishes for any pure states in a sym- 
metric GPT. To see this, we first show: 

Lemma 9 In any GPT, there exists a pure state s such 
that Si{s) = 0. 

Proof Let ei be an indecomposable and pure effect (see 
Proposition [12]) , and let m — ei = 62 + . . . + e„i be an 
indecomposable decomposition of u — ei (see Proposition 
[TT|) . Then, AI = {ej)"^^ is an indecomposable measure- 
ment. From Proposition 1101 there exists a pure state s 
such that ei(s) = 1. Thus, wc have H{ej{s)) = 0, and 

Si{s) = o. m 

Proposition 26 Let S be the state space of a symmetric 
GPT. Then, Si{s) ~ for any pure state s. 

Proof From Lemma [HI there exists a pure state sq such 
that 51(50) = 0. For any pure state s, there exists a 
bijective affine / such that sq = /(s). Let AI = {mj)j be 
an indecomposable measurement such that i/(mj(so)) = 
0. Then, it is easy to see that M := {fhj)j where frij := 
ruj o / is an indecomposable measurement. Therefore, it 
follows that II{ifij{s)) = II{mj{so)) = 0. Thus, we have 
proved that S'i(s) = for any pure state s. ■ 
Therefore, in a symmetric GPT, Si measures a pure- 
ness in some sense. However, as the squared GPT shows, 
the converse of Proposition [2S] docs not holds in general 
even among symmetric GPTs. 



VIII. CONCLUDING REMARKS 

We have discussed some distinguishability measures 
(especially, Kolmogorov distance and fidelity) in any 
GPT. In a similar way of quantum information theory, it 
will be convenient to use these measures in constructing 
an information theory in GPT. Indeed, we have refor- 
mulated no-cloning theorem and information-disturbance 
theorem using fidelity. 

We have also proposed and investigated three quan- 
tities related to entropies in any GPT. All of them are 
generalizations of Shannon and von Neumann entropy in 
classical and quantum systems, respectively. However, 
they are in general distinct quantities, as the squared sys- 
tem gives the example. The concavity of Si in any GPT 
holds while it breaks for ^2 and ^3 in some GPT. S2 and 
S3 provides a measure for pureness, while Si does not. 
However, in a symmetric GPT which satisfies the prin- 
ciple of equality of pure states, it follows that 6*1(5) = 
for any pure states 5. In the attempt to find principles 
of our world, which is described by a quantum system at 
least for the present, we think that symmetric GPTs are 
enough to consider by assuming the principle of equality 
for pure states. However, let us remark here that both 
classical and quantum systems satisfy stronger principle, 
which wc call strong equality for pure states or equality 
for distinguishable pure states which can be formulated 
as follows: 

Definition 5 (Strong equality for pure states) We say 
that GPT satisfies the principle of strong equality for pure 
states if it satisfies the following: Let {si G 5p«re}"=i and 
{ti G Spure}iLi (1st n > m) be two distinguishable sets of 
pure states, i.e., there exists a measurement M = {mi)i 
(N = (ni)i) such that mi(sj) = Sij (ni{tj) = 8ij). Then, 
there exists a bijective affine map f on S such that ti = 
J{si) (i = 1, . . . ,to). 

Notice that the squared GPT is symmetric but does not 
satisfies this strong equality for pure states. (For in- 
stance, consider {(0, 0), (0, 1)} and {(0, 0), (1, 1)}.) It 
might be interesting to consider these kind of stronger 
conditions which classical and quantum systems satisfy. 
In particular, we don't know any principles which makes 
the converse of Proposition to hold. 

Acknowledgment We would like to thank useful 
comments and discussions with Dr. Imafuku and Dr. 
Miyadera. Part of this work is supported by Grant-in- 
Aid for Young Scientists (B), The Ministry of Educa- 
tion, Culture, Sports, Science and Technology (MEXT) 
(No.20700017). 

Note added. Related but independent works for en- 
tropies in GPT has appeared recently in Ref. [3l|, [s^ 
while completing this paper. We will continuously see 
the relations between our work and the results there in 
near future. 
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APPENDIX A: PROOFS OF SOME 
PROPOSITIONS 

[Proof of Proposition [2] Notice that for any mea- 
surement M ~ {mi} G A^(iS'), and any afBne map 
A € A{S,S'), we have another measurement N = {rii} e 
M{S) where rii := mi o A. Let M = {mi} £ A4{S') be 
an optimal measurement which attains the maximum: 

Z?(A(si),A(s2)) = i^K:(A(si))-m,(A(s2))|. 

i 

Then, we have 



[Proof of Proposition [3] Let AI = {mi} e be a 
measurement which satisfies 

D(^YlptSi,Y^q,U^ = ^Yl\mi{J2PjSjymi{Y^qjtj)\. 

i i i j j 

Then, we have 



< 



^^Pj\{mi{sj) - m^{tJ))\ + ^^\{pj- q3)\m^{tj) 



< J2PjD{s,t) + D,{p„q,), 
j 

where we have used (i) afiinity of m^, (ii) triangle in- 
equality of I • I, and (iii) mi = u. I 
[Proof of Proposition [4] The proof goes almost sim- 
ilar manner with that of Proposition [21 only noting a 
technical treatment of the infimum: For any e > 0, 
there exists an "optimal" measurement M = {mj)j such 

that F(A(si),A(s2)) + e > EjV'«j(A(si)K(A(s2)) 
from the definition of the fidelity. By using a measure- 
ment N = {ni)i where := o A, it follows that 

F(A(.si),A(s2)) + e > EjVM^lKls^ > F{.si,S2). 
Since e > is arbitrary, we obtain the monotonicity. ■ 
[Proof of Proposition[5] For any e > 0, let M = {mi)i 
be an "optimal" measurement satisfying 



F(^PiSi,^qiU)+e > ^ mk(^PtS,)mk(^qjtj). 

i i k y i j 

Using the affinity of and the Schwarz inequality be- 
tween vectors {yj ptmk{si))i and {yj qimk{ti))i, one gets 

FC^PiSi.^qiU) + t 



> ^ ^ \/Pi'^k{si)qimk{ti) > ^ y/p^F{si,ti 



Letting e — > 0, we obtain the strong concavity. ■ 
[Proof of Proposition [6] (i) For any e > 

0, let M = (m,), e Mis a) and N = 
{nj)j e AiiSs) be "optimal" measurements such that 
f (si, S2) + £ > S i a/™'('5i)"^j(s2) and F{ti,t2) + e > 
J2j y^^WKTfe), respectively. Then, = (g) 
rij G £{Sa'^ Sb) gives a (joint) measurement G = 
{g^J)^j & M{Sa®Sb), a.nd{F {si,S2) + e){F{ti,t2) + e) > 

J2ij Vdijisi ®ti)gij{s2®t2) > F{si (g) ti,S2<^t2)- 

(ii) For any e > 0, let M ~ {mi)i € be 
an "optimal" measurement such that F[sA,tA) + e ^ 
Y.i ^/mi{sA)mi [tA). By noting that gi m^^UB gives 
a measurement on Sa(S)Sb and gi{s) — mi{sA), gi{t) — 
mi{tA), onehasF(sA,tyi)+e > Y.i ^/ 9ii.s)gi{t) > F{s,t). 

(iii) The inequality F{si,S2) > F{si ® t,S2 ^ t) > 
- y/mi{si) (g mi{s2) follows from (i) and F{t, t) = 

1. To see the opposite inequality, let G = {gi)i G 
M.{Sa® Sb) be an "optimal" measurement such that 
F{si ®t,S2®t) + e> \/ 9i{si ® t)gi{s2 ® t) for any 
e > 0. Then, since mi{s) := gi{s ® t) (Vs € Sa) gives a 
measurement M = {mi)i € Sa: we have F{si ^ t,S2 <S) 
t) + e>J2i Vwi(si) (g) mi{s2) > F{si, S2). 

■ 

[Proof of P rop osition \7\ The proof is essentially the 
same as in [2^. For any e > 0, let AI ~ {mi)i be 
an "optimal" measurement which satisfies F{s, t) + e > 
J2iVPt^ where pi := mt{s),qi m,(t). It follows 
that Ei(v^ - V*)^ T.^P^ + 2^;, ^/p^gI > 

2(l-F(s,i)-e). Noting that I^K " ^ \^/P'^ + V^^l 

we have X;i(V^- V*)^ = ~ V^'HV^^ V?*! - 

EIp^ - 9d < 2D{s,t). Next, let TV = {/J e 7W 
be an optimal measurement which satisfies D{s,t) = 
\ Hi l^'i-SiL where = ni(s), Sj = nj(t). Then, we have 



4 



■■ < 
1-E 



d)' = i(EJ^/^-^AIllV^ + 

■|2\ 



^2^ 



nsi ) (E 
) U + E. 



J2,si + 2EtV^. 



= i-(E. 



< 



k i 



1 — F{s,t)'^, where we have used the Schwarz inequality. 
■ 

[Proof of Proposition 111] In the proof, we use some 
terminology from convex geometry. We say that a subset 
C of a finite dimensional Euclidean space is a cone if 
V & C and A > imply Aw S C (hence G C). We say 
that a closed convex cone G is pointed if C n — C = {0}. 
In the proof of Proposition [Til we use the following fact 
for pointed cones: 

Lemma 10 ((29!, Theorem 3.3.15]) A closed convex 
cone C C M.^ is pointed if and only if there is a linear 
functional f on such that G' = {v £ G \ f{v) ~ 1} 
is compact and satisfies C — {Aw | u G C", A > 0}. 

We proceed the proof of Proposition [Til Put A'' = 
dim(5) < 00 and let 5 C ^ = (recall that now 
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S is finite dimensional). Choose si, . . . , sjv+i S 5 such 
that V is the afBne hull of these iV + 1 points. Then any 
afhne functional on S extends to a unique affine func- 
tional on V, therefore the set Aff J of all nonnegative 
affine functionals f on S can be embedded in V' = R^+^ 
where the z-th coordinate signifies the value at s;. Now 
the embedded image of Aff+ in V is a pointed closed 
convex cone, where the closedness follows since elements 
/ of the set Aff J arc characterized by closed relations 
among the values of / at the points s^. Thus by Lemma 
[TOl there exists a linear functional g on V' such that 
C = {e G AffJ I g{e) ~ 1} is compact and satisfies 
AffJ = {Ae I e e C, A > 0}. Note that C is convex by 
definition. 

Let ^ e E £{S)- Then we have Ae £ C for some 
A > by the property of C. Since C is compact and 
convex, the Krein-Milman's Theorem implies that Ae can 
be written as a finite convex combination Ae = Px^x 
of extreme points Cx of C. Since S is compact, by taking 
a sufficiently small /i > it follows that ex — fJ^Cx € £{S) 
for every x. Moreover, choose an integer fc > such that 
kXn > 1. Then we have a decomposition 

of e into a finite collection of effects (note that < 
Px/{kXfi) < 1). 

Our remaining task is to show that each qxSx, where 
Qx = Px/ikX^), is an indecomposable effect provided 
qx > 0. Let QxCx = e' + e" with e', e" e S{S), e', e" ^ 0. 
Then we have Cx — {(ix[i)^^{E' + e"). By the property 
of C, there exist v' ^v" > and e',e" S C such that 
e' = u'e' and e" = i'"e". We have ex = rfe' + ri"e", 
where ?/ = [qxIJ.)^^!^' > and rj" = {qx^)^^v" > 0. 
Moreover, by the definition of C, we have 

1 = aisx) = v'aie') + n'gi^e') = ?/ + T]" . 
Since ex is an extreme point of C, it follows that ex = 



e" , therefore e' 



ex 



{ly' /{fiqx))qxex- Hence 



qxex is indecomposable as desired, concluding the proof 
of Proposition [TT] ■ 
[Proof of Lemma [2] It is easy to show e is an effect. 
Let e = ei + 62 be an effect decomposition of e. Then, 
e = qei + qe2 is an effect decomposition of e since q < I. 
Since e is indecomposable, there exists c € M such that 
qei = ce, or ei = ce. Thus, e is indecomposable. ■ 
[Proof of Lemma4 [3] Let e = Aei + (1 — A)e2 be a 
convex decomposition of e with A G (0, 1). It is easy to 
see that ei(s) = e2(s) = 1. Since Aei, (1 — A)e2 e £ and 
e is indecomposable, we have Aei = ce for some c G M. 
Applying this to s, we have A = c, and thus ei = e. 
Therefore, e is a pure effect. ■ 
[Proof of Lemma [4] First we show that an indecom- 
posable e is not constant on S- Since S has at least two 
states, the separation property of states implies that a 
non-constant effect / G £{S) exists. If e takes constantly 



c G (0, 1], then the decomposition e = cf + c{u — /) con- 
tradicts that e is indecomposable. Hence e is not con- 
stant. Second, if e does not take at any state, then 
we have e(s) > c for some c > and all s G 5 since e 
is continuous and S is compact. Now the decomposition 
e = CM + (e — cu) contradicts that e is indecomposable. 
Hence e takes at some state. ■ 
[Proof of Proposition [18] First we compute Si{s) for 
s = (ci, C2) G &q- Let M = {m.i)i be an indecomposable 
measurement. To compute 5*1, it suffices to consider the 
case that M contains at most one effect rrii of each of the 
four types listed in Table [H indeed, if m^j and nii^ are of 
the same type (i.e., m^^ = Am^j for some A G R), then 
by replacing the pair of and m^^ with -I- m^^ 
the value of H{mi{s)) is not increased. Thus we may 
assume without loss of generality that M consists of the 
four effects in Table |T] with parameters ai = a2 = a, 
= a4 = P :~ 1 — a for some a G [0, 1]. Now, by 
putting g{x) = — a; log a; we have 



H{m,is)) 

= g{aci) + g{a{l - ci)) + g{l3c2) + g{fi{l 
= g{a) + ah{ci) + g{f3) + (3h{c2) 
= h(a) + ah(ci) + (1 — a)h{c2) . 



C2)) 



Since the right-hand side is concave on a G [0, 1], it takes 
the minimum at either a = or a = 1, hence wc have 
Si{s) min[/i(ci), /i(c2)] as desired. 

Second, we compute S2{s) for s = (ci,C2) G Ssq- Let 
{Px,Sx}x G vis) with Sx = icx,i,Cx,2) and M = {mj)j G 
Mind- Again, it suffices to consider the case that M 
contains at most one effect rrij of each of the four types 
listed in Table |T1 indeed, if rrij-^ and m^v, are of the same 
type (in the above sense), then by replacing the pair of 
TOjj and TOj2 with rrij-^ + rrij^ the value of H{X : J) 
is not changed. Thus we may assume without loss of 
generality that M consists of the four effects in Table U 
with parameters ai = q;2 = a, 03 = 04 = /3 := 1 — a for 
some a G [0, 1]. Now a direct calculation implies that 

H{X : J) 

= h{a) + ah{ci) + Ph{c2) 

- X! P^iHc') + Oih{Cxs) + PKcx^)) 
x£X 

= a{h{ci) - ^pxh{cx,i)) + P{h{c2) - '^PxHcx^)) ■ 

X X 

Since all the pure states {cx,ItCx,2) in 5sq satisfy that 
Cxs e {0, 1} and Cx,2 S {0, 1}, we have H{X : J) = 
ah{ci)+/3h{c2) = ah{ci) + {l — a)h{c2), which is indepen- 
dent of the given decomposition {px, Sx}x of s. This im- 
plies that S2{s) = supx.j H{X : J) = max[/i(ci), /i(c2)], 
as desired. 

Finally, we compute 5*3(5) for s = (ci,C2) G Ssq- By 
the reason similar to the case of Si, to compute 5*3(5) it 
suffices to consider a decomposition {px, Sx}xex € Vis) 
such that all Sx are different pure states. Thus we may 
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assume that X = {00,01,10,11}, soo = (0,0), sqi = 
(0,1), sio = (1,0) and Sn = (1,1). Now by putting 
Pii — p vie have 

PlO = Ci - p, poi = C2 - p, Poo = 1 - Ci - C2 + p . 

In the above expression, we have Px G [0, 1] for every a; if 
and only if p„i < p < pm , where 



Pn 



niax[0, ci + C2 - 1], Pm = min[ci, C2] 



Hence we have 5*3(5) = iiifp„,<p<pM H{px)- Now a direct 
calculation shows that 



dp 



for any p G (pm,PM), therefore H{px) takes the min- 
imum at cither p = pm or p = pM- ^sis) ~ 
min [77 (p^ ) | p=p,„ , (p^; ) | p=pM ] • 

First we consider the case that ci < C2 and ci + C2 < 1 
(i.e., s G i?2(7 or s G R2b), therefore Pm = andpM = ci. 
If p = Pm then we have {px)x = (1 - ci - C2, C2, ci, 0), 
while if p = Pm then we have (j>x)x ~ (1^C2, C2 — ci, 0, ci). 
This implies that 

diffM-m := ii{Px)\p=p^^ - H{px)\p=p^ 

= 5(c2 - ci) + .g(l - C2) - g{c2) - g{l - ci - C2) 



where g{a) = —a log a, therefore 



_d_ 

dc-. 



diff 



M-m 



log 



(1 - C2)C2 



(C2 - Ci)(l - Ci - C2) 



which is now non-negative by the conditions for ci and 
C2. Since diffM-m = when C2 = 1/2, it follows that 
diffM-m < and 53(5) = H{px)\p=p^ when < C2 < 
1/2 (i.e., s G R2b), and diffM-m > and 6*3(5) = 
H{px)\p=pm when 1/2 < C2 < 1 (i.e., 5 G R2u)- Hence 
the expressions of 5*3(5) in (fTSl) for s G R2U and 5 G R2B 
are proved. The claim for the remaining cases follow by 
considering suitable symmetry of the state space iSsq- ^ 
[Proof of Proposition [T9] The first inequality 5*i (5) < 
62(5) is obvious by p6|) and (fT7|) . For the second inequal- 
ity 5*2(5) < 53(5), by symmetry, we may assume without 
loss of generality that 5 = (ci,C2) G i?2C/, i.e., 1/2 < 
C2 < 1 — Ci. This condition implies that h{ci) < h{c2), 
therefore 5*2(5) = h{c2). On the other hand, (|18p im- 
plies that 53(5) — g{ci) + g(c2) + .9(1 — ci — C2), where 
g{x) = —x\ogx. Thus we have 

53(5) - 52(5) = g{ci) + 5(1 - ci - C2) - 5(1 - C2) 

which is a decreasing function of C2 in this range, while 
53(5) — 52(5) = when C2 = 1 — ci. This implies that 
53(5) > 5*2(5) for any 5 G -R2C/, hence the claim holds. ■ 
[Proof of Lemma[6] For each x £ X, let {q^, ty}y^Y^ G 
Visx) and Mx = (™j)j6Jx G M.- Then we have 
{Pxqy,tl}xex.yeY^ G vts) and M' = {■Kxrrv^j)x(zx,j&j^ G 
A^. Let Z = {{x,y) \ x e X,y £ Yx} and K = {{x,j) \ 



X G X,j G Jx} denote the index sets of these ensembles, 
respectively. Then, by putting h{a) = —a log a we have 

H{Z : K) = H{K) - H{K \ Z) 
= J2 K-^xm^j{s))- px'q^'hiTTxm^itl')) 



{x,j)eK 



{x\y)eZ 

ix,j)eK 



= m^{s)h{TTx) - Y Px'qy'm^{tf)h{TTx) 
(x,3) {x',y),{x,j) 

+ J2 T^xhim^is)) - Y Px'qy'nxhim]{tf)) 

(x',y),(x,j) 

= (1- ^ Px'ql)Yh{^-) 
{x',y) X 

+ ^ Ihim^is)) Y P-^'qyHm^tl')) 
= ^ TT, ( /.(m,-(5)) - Y Px'q^y'Hm^itl')) 

Since m^{s) = Pxm'^{sx) +J2{x',y)eZ:x'^xP^"lym^itv') 
and h{a) is concave on a G (0, 1), we have h{mj{s)) > 

PxHm^isx)) +J2(x',yy,x'^xP^"ly'Hm^ity)), therefore 

HiZ : if) > ^ TT, lpxh{m^^{sx)) - ^ P^fyKm^M)) 
(■■rj) \ y^y^ 

= Y T^xPxH{Yx : Jx) ■ 

xex 

By taking the suprcmum over all {qy,ty}yi:Y^ and Mx, 
X E X (see Lemma [5]), it follows that 

52(5) > snp H{Z : K) > ^ '!TxPxS2{,Sx) ■ 

x£X 

Hence Lemma [5] holds. ■ 
[Proof of Lemma [7] We use induction on the number 
N of indices j such that Xj ^ {0, l/fc}. The claim is triv- 
ial if = 0, while it cannot happen that = 1. We as- 
sume N >2, and 0<a;i<a;2<l/fcby symmetry. Now 
ifa-i-|-a;2 < 1/A:, then we have /i(xi)-|-/i(x2) > h{xi+X2), 
therefore H{x) > Hijj) where y = (xi + 2:2, 0:3, . . . , a;^). 
On the other hand, if Xi + X2 > 1/fc, then we have 
h{xi) + h{x2) > h{xi + X2 — 1/fc) + /i(l/fc), therefore 
H(x) > H(y) where y = {xi + X2 ~ 1/fc, 1/fc, ^3, . . . , Xi). 
In any case, we have Hly) > logfc by the induction hy- 
pothesis. Hence H{x) > logfc as desired. ■ 
[Proof of Lemma [8] Choose ti,t2, ■ ■ ■ ,tn G S (n = 
dim 5 + 1) such that these are afhne independent. Then 
any element t oi S has a unique expression t = Xiti + 
■ ■ ■ + Xntn, J2j = 1- 'P '■ S R" denote the map 
t (Ai, . . . , A„). Since 5 is a topological subspace of a 
finite-dimensional Euclidean space, S and ip{S) C M" are 
homeomorphic via ip. By identifying S with ip{S) in this 
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way, the map fs is written as /s(e; Ai, . . . , A„) = Aie(ti)+ 
• • • + A„e(<„). This implies that fs is continuous, since 
both (e; Ai, . . . , A„) i-^ eitj) and (e; Ai, . . . , A„) i— > Xj are 
continuous. ■ 



APPENDIX B: GPT WITHOUT COMPLETE 
MEASUREMENTS. 

Here we give an example of a GPT that has no com- 
plete measurements. First note that any nonzero extreme 
effect e takes 1 at some state, as otherwise we have a 
nontrivial expression e = c(c~^e) + (1 — c)0 as a convex 
combination of effects, where c = sup^^^ e(s) G (0, 1). 

We consider a GPT with state space S which is the 
convex huU of four points (0,0), (1,0), (0,1), (2,2) in 
M^. Then by the above observation and Lemma |4l each 
indecomposable extreme effect e takes at an edge of S 



and takes 1 at some state (precisely, at the vertex of S 
farthest from the edge). Thus there are four indecompos- 
able extreme effects in total, as listed in Table HH Now 
it is obvious that no complete measurements exist in the 
GPT, since the sum of the values of indecomposable ex- 
treme effects at the state (0,0) cannot equal to 1. 



TABLE II: Indecomposable extreme effects in Appendix [B] 







value at 




effect 


(0,0) 


(1,0) 


(0,1) 


(2,2) 


ei 








1/2 


1 


62 





1/2 





1 


63 


2/3 





1 





64 


2/3 


1 
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d+l d+1 
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